Using Support Vector Machines for Multicategory Cancer Diagnosis Based on Gene Expression Data

نویسندگان

  • Alexander Statnikov
  • Constantin F. Aliferis
  • Ioannis Tsamardinos
چکیده

In an effort to contribute to the development of accurate cancer diagnosis based on gene expression data, this study performs a comprehensive evaluation of multicategory Support Vector Machine (MC-SVM) algorithms applied to the majority of cancer-related gene expression microarray datasets currently freely available to the scientific community. Our results show that: (a) MC-SVMs are very effective in performing accurate cancer diagnosis in high-dimensional gene expression data. The MC-SVM techniques by Crammer and Singer, Weston and Watkins, and one-versus-rest are the best methods in this domain. (b) MC-SVMs outperform other popular machine learning algorithms to a remarkable degree. (c) A prototype software tool which develops MC-SVM classifiers in a fully-automated fashion is introduced. Results produced by the tool compare favorably with previously published studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data

MOTIVATION High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification method...

متن کامل

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis

MOTIVATION Cancer diagnosis is one of the most important emerging clinical applications of gene expression microarray technology. We are seeking to develop a computer system for powerful and reliable cancer diagnostic model creation based on microarray data. To keep a realistic perspective on clinical applications we focus on multicategory diagnosis. To equip the system with the optimum combina...

متن کامل

Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...

متن کامل

A Comparison of SVM-based Evolutionary Methods for Multicategory Cancer Diagnosis using Microarray Gene Expression Data

Selection of relevant genes that will give higher accuracy for sample classification (for example, to distinguish cancerous from normal tissues) is a common task in most microarray data studies. An evolutionary method based on generalization error bound theory of support vector machine (SVM) can select a subset of potentially informative genes for SVM classifier very efficiently. The bound theo...

متن کامل

Gene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method

Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003